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Abstract 

Intrinsically disordered proteins (IDPs) do not possess well-defined three-dimensional structures in solution under 
physiological conditions. We develop all-atom, united-atom, and coarse-grained Langevin dynamics simulations for 
the IDP a-synuclein that include geometric, attractive hydrophobic, and screened electrostatic interactions and are 
calibrated to the inter-residue separations measured in recent smFRET experiments. We find that a-synuclein is 
disordered with conformational statistics that are intermediate between random walk and collapsed globule behavior. 
An advantage of calibrated molecular simulations over constraint methods is that physical forces act on all residues, not 
only on residue pairs that are monitored experimentally, and these simulations can be used to study oligomerization 
and aggregation of multiple a-synuclein proteins that may precede amyloid formation. 



1 Introduction 

Intrinsically disordered proteins (IDPs) do not pos- 
sess well-defined three-dimensional structures in 
physiological conditions. Instead, IDPs can range 
from collapsed globules to extended chains with 
highly fluctuating conformations in aqueous solu- 
tion [T]. IDPs play a significant role in cellular 
signaling and control since they can interact with 
a wide variety of binding targets [2J. In addition, 
their propensity to aggregate to form oligomers and 
fibers has been linked to the onset of amyloid dis- 
eases The conformational and dynamic hetero- 
geneity of IDPs makes their structural characteriza- 
tion by traditional biophysical approaches challeng- 
ing. Also, force fields employed in all-atom molecular 
dynamics simulations, which are typically calibrated 
for folded proteins, can yield results that differ sig- 
nificantly from experiments (4]. 

In this manuscript, we focus on the IDP a-synuclein, 
which is a 140-residue neuronal protein linked to 
Parkinson's disease and Lewy body dimentia [5]. Pre- 
vious NMR studies have found that a-synuclein is 
largely unfolded in solution, but more compact than 



a random coil with same length |H El [7] . The pre- 
cise mechanism for aggregation in a-synuclein has not 
been identified, although it is known that aggregation 
is enhanced at low pH [5J [71 [S] , possibly due to the 
loss of long-range contacts between the N- and C- 
termini of the protein j 10] . 

Quantitative structural information has been ob- 
tained for a-synuclein using single-molecule fluores- 
cence resonance energy transfer (smFRET) between 
more than twelve donor and acceptor pairs 
These experimental studies have measured inter- 
residue separations for both the neutral and low pH 
ensembles. Prior studies have implemented the inter- 
residue separations from smFRET as constraints in 
Monte Carlo simulations with only geometric (e.g. 
bond-length and bond-angle) and repulsive Lennard- 
Jones interactions to investigate the natively disor- 
dered ensemble of conformations for monomeric a- 
synuclcin [12 . In contrast, we develop all-atom, 
united-atom, and coarse-grained Langevin dynamics 
simulations of a-synuclein that include geometric, at- 
tractive hydrophobic, and screened electrostatic in- 
teractions. The simulations are calibrated to closely 
match the inter-residue separations from the sm- 
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FRET experiments. An advantage of this method 
over constrained simulations is that physical forces, 
which act on all residues in the protein, are tuned so 
that the inter-residue separations from experiments 
and simulations agree. In future studies, we will em- 
ploy these calibrated Langevin dynamics simulations 
to study oligomerization and aggregation of multiple 
a-synuclein proteins over a range of solvent condi- 
tions. 



2 Methods 

The 140-residue IDP a-synuclein includes a nega- 
tively charged N-terminal region, hydrophobic cen- 
tral region, and positively charged C-terminal region 
(Fig. [IJ at neutral pH. We study three models for 
a-synuclein with different levels of geometric com- 
plexity: a) all-atom, b) united-atom, and c) coarse- 
grained, as shown in Fig. 2. 



bond angles, and dihedral angles were below 0.05 A 
and 0.008 rad, respectively. These rms values occur 
in the protein crystal structures from the Dunbrack 
database. Note that no explicit interaction potentials 
were used to constrain the backbone dihedral angles <f> 
and if) and side-chain dihedral angles. However, the 
bond lengths, bond angles, and sizes of the atoms 
were were calibrated so that they take on physical 
values. (See Appendix [A]) 

We included three types of interactions between non- 
bonded atoms: 1) the purely repulsive Lennard- 
Jones potential V r to model steric interactions, 2) 
attractive Lennard- Jones interactions V a between 
C a atoms on each residue to model hydrophobic- 
ity, and 3) screened electrostatic interactions V es be- 
tween atoms in the charged residues LYS, ARG, HIS, 
ASP, and GLU. Thus, the total interaction energy is 
V = V U + V ba + V da + V r + V a + V es . (See Fig. ||) 
The purely repulsive Lennard-Jones potential is 



All-Atom Model 

The all-atom model (including hydrogen atoms) 
matches closely the geometric properties of proteins. 
The average bond lengths (hj), bond angles (9ijk), 
and backbone dihedral angle uj between atoms C a - 
C-N-C a on successive residues were obtained from 
the Dunbrack database of 850 high-resolution pro- 
tein crystal structures |13| . The 242 distinct bonds 
and 440 distinct bond angles in a-synuclein were fixed 
using the following spring potentials: 
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where ki is the bond-length stiffness and is the 
center-to-center separation between bonded atoms i 
and j, and 
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where kg is the bond-angle stiffness and 6^ is the 
angle between bonded atoms i, j, and k. The aver- 
age backbone dihedral angle between the C a -C-N-C a 
atoms was constrained to zero using 
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We chose ki — 5 x 10 3 k o T /A 2 and kg = k u = 
2 x 10 5 /fcbTo/rad 2 (with T Q = 293K) so that the root- 
mean-square (rms) fluctuations in the bond lengths, 
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where (x) is the Heaviside step function that sets 



V r = for 
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a-j = (cr[ + (Tj)/2 is the average diameter of atoms 
i and j. We used the atom sizes (for hydrogen, car- 
bon, oxygen, nitrogen, and sulfur) from Ref. (T3] af- 
ter verifying that the backbone dihedral angles for 
the all-atom model sample the sterically allowed <f> 
and if) values in the Ramachandran map [15 when 
V = V U + V ba + V da + V r . (See Appendix^) 

The hydrophobic interactions between residues were 
modeled using the attractive Lennard-Jones potential 
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where e is the attraction strength, i?^ is the center- 
to-center separation between C a atoms on residues i 
and j, 

Ay = y/hihj, (6) 

hi is the hydrophobicity index for residue i that 
ranges from (hydrophilic) to 1 (hydrophobic) in 
Table [l] and a a w 4.8 A is the typical separation 
between centers of mass of neighboring residues. We 
find that the results for the conformational statistics 
for a-synuclein are not sensitive to small changes in 
<j a and hi (Appendix [bI . 
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MET ASP VAL PHE MET LYS GLY LEU SER LYS ALA LYS GLU GLY VAL VAL ALA ALA ALA GLU 20 

LYS THR LYS GLN GLY VAL ALA GLU ALA ALA GLY LYS THR LYS GLU GLY VAL LEU TYR VAL 40 

GLY SER LYS THR LYS GLU GLY VAL VAL HIS GLY VAL ALA THR VAL ALA GLU LYS THR LYS 60 

GLU GLN VAL THR ASN VAL GLY GLY ALA VAL VAL THR GLY VAL THR ALA VAL ALA GLN LYS 80 

THR VAL GLU GLY ALA GLY SER ILE ALA ALA ALA THR GLY PHE VAL LYS LYS ASP GLN LEU 100 

GLY LYS ASN GLU GLU GLY ALA PRO GLN GLU GLY ILE LEU GLU ASP NET PRO VAL ASP PRO 120 

ASP ASN GLU ALA TYR GLU MET PRO SER GLU GLU GLY TYR GLN ASP TYR GLU PRO GLU ALA 140 

Fig. 1: The three main regions of the 140-residue protein a-synuclein. Residues 1-60 form the highly basic 
N-terminal region (bold, blue), residues 61-95 form the hydrophobic central region (plain text), and 
residues 96-140 form the acidic C-terminal region (italics, red) |10l [TT] . 




Fig. 2: Snapshots of the (left) all-atom, (center) united-atom, and (right) coarse-grained representations of 
a-synuclein from Langevin dynamics simulations at temperature Tq — 293K, pH 7.4, and ratio of 
hydrophobic to electrostatic interactions a = 1.1. For the all-atom and united-atom models, hy- 
drogen, carbon, oxygen, nitrogen, and sulfur atoms are colored white, cyan, red, blue, and yellow, 
respectively. For the coarse-grained model, each blue-shaded monomer represents an amino acid. 
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The screened Coulomb potential was used to model 
the electrostatic interactions between atoms i and j 
for a-synuclein in water: 

i] ■> 

where e is the fundamental charge, e es = a a e 2 /4ireoe, 
eo is the vacuum permittivity, e = 80 is the permit- 
tivity of water, and £ = 9 A is the Coulomb screening 
length in an aqueous solution with a 150mM salt con- 
centration. The partial charge qi on atom i in one of 
the charged residues LYS, ARG, HIS, ASP, and GLU 
is given in Table [2] 



United-Atom Model 

For the united-atom model, we do not explicitly 
model the hydrogen atoms. Instead, we use a set 
of 11 atom sizes a\ from Ref. |18| . where the hydro- 
gens are subsumed into the heavy atoms: C (c[/2 = 
1.53 A), CH (1.80 A), CH 2 (1.80 A), CH 3 (1.80 A), 
O (1.26 A), OH (1.44 A), N (1.53 A), NH 2 (1.57 A), 
NH 3 (1.80 A), and S (1.62 A). We optimized the atom 
sizes by characterizing the backbone dihedral angles 
<j) and ip as a function of ct[ in the united-atom simu- 
lations with V = V bl + V ba + V da + V r . The (j) and ip 
backbone dihedral angle distributions closely match 
that from the Ramachandran map (i.e. the a-helix 
and /3-sheet regions) when we scale the atom sizes in 
Ref. [H] by 0.9 as shown in Appendix [A] Otherwise, 
the all-atom and united-atom models use the same 
interaction potentials in Eqs.[T][7] 

Coarse-Grained Model 

For the coarse-grained model, we employed a 
backbone-only C a representation of a-synuclein 
where each residue i is represented by a spherical 
monomer i with size a a , mass M, hydrophobicity hi, 
and charge Qi- The average bond length between 
monomers i and j was fixed to (Zy) = 4.0 A, which is 
the average separation between C a atoms on neigh- 
boring residues, using Eq. [TJ (with nj replaced by 
Rij). The bond-angle O (between three successive C a 
atoms) and dihedral-angle <I> (between four successive 
C a atoms) potentials were calculated so that the O 
and $ distributions matched those from the united- 
atom simulations with V = V bl + V ba +V da + V r . The 
O distributions from the united-atom model were ap- 
proximately Gaussian with mean (O) = 2.13 rad and 
standard deviation <jq — 0.345 rad. 



The dihedral angle potential V for the coarse- 
grained simulations was obtained by fitting the dis- 
tribution P($) from the united-atom simulations to 
a seventh-order Fourier series 

6 

y da ($) = £ a fc cos (fc$) + b k sin (k<S>) , 

k=Q 

where a k = -2k b T (cos logP ($)), b k = 
— 2fc & T (sin log P (<&)), and the angle brackets 
indicate an average over time and dihedral angles 
along the protein backbone. 

For steric interactions between residues, we used the 
purely repulsive Lennard-Jones potential in Eq. ^] 
with and Oy replaced by Rij and a a respectively. 
The hydrophobic interactions are the same as those 
in Eqs. [5] and [6] with e a — e r . The electrostatic inter- 
actions between residues are given by Eq. [7] with qi 
and rij replaced by Qi and Rij, respectively. 

Langevin Dynamics 

The all-atom, united-atom, and coarse-grained mod- 
els were simulated at fixed NVT using a Langevin 
thermostat [TI5], modified velocity Verlet integration 
scheme, and free boundary conditions. We set the 
time step At — 10~ 2 £o and damping coefficient 

7 = lO -3 *^ 1 , where to = y l rn(a^) / 'e r and m is 
the hydrogen mass for the all-atom and united-atom 
models and to = J Ma a /e r for the coarse-grained 
model. The initial atomic positions were obtained 
from a micelle-bound NMR structure (protein data 
bank identifier 1XQ8) for a-synuclein at pH 7.4 and 
temperature 298K [5U]. The initial positions for the 
coarse-grained model were obtained from simulations 
at high temperature with only bond-length, bond- 
angle, and dihedral-angle constraints and repulsive 
Lennard-Jones interactions. The simulations were 
run for times much longer than the characteristic re- 
laxation time from the decay of the radius of gyration 
autocorrelation function. 

In the results below, we will study the radius of gyra- 
tion R g and distribution of inter-residue separations 
P(Rij) as a function of the ratio of the attractive hy- 
drophobic and electrostatic energy scales a — e a /e es 
and quantitatively compare the results from smFRET 
experiments and all-atom, united-atom, and coarse- 
grained simulations. 

3 Results 

In Fig. [4j we show the radius of gyration that char- 
acterizes the overall protein shape for the all-atom, 
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Fig. 3: Schematics of (a) the purely repulsive Lennard- Jones potential V r in Eq. |4j (solid line), (b) attractive 
Lennard- Jones potential V a in Eq. [5] (solid line), and (c) screened Coulomb potential V es in Eq. [7] 
(solid line). The dashed line in (b) represents repulsive Lennard- Jones interactions between residues 
i and j in the coarse-grained model. 



ALA 


ARC 


ASN 


ASP 


CYS 


GLN 


GLU 


GLY 


HIS 


ILE 


0.735 


0.37 


0.295 


0.41 


0.76 


0.41 


0.54 


0.5 


0.29 


1 




LEU 


LYS 


MET 


PHE 


PRO 


SER 


THR 


TRP 


TYR 


VAL 


0.985 


0.385 


0.87 


1 


0.27 0.475 0.565 0.985 0.815 0.88 



Tab. 1: Hydrophobicity indices hi that range from (hydrophilic) to 1 (hydrophobic) for residues in a- 
synuclein at pH 7.4 |16| . 



Residue 


Atom 


Atom Charge qi 


Residue Charge Qi 


LYS 




1 


1 


ARG 




0.39 


1 


N, 2 


0.39 


N, 


0.22 


HIS 


N dl 


0.05 


0.1 


N e2 


0.05 


ASP 


C-ai 


-0.5 


-1 


62 


-0.5 


GLU 


Qel 


-0.5 


-1 




-0.5 



Tab. 2: Partial charges q^ on atom i (left) and total charge Qi on residue i (right) for the charged residues 
LYS, ARG, HIS, ASP, and GLU at pH 7.4 [T7]. The total partial charge q = qi for the N-terminal, 
central, and C-terminal regions are 4.1, — 1, and —12.0, respectively. 
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Fig. 4: Average radius of gyration (R g ) versus the ratio of the attractive hydrophobic to electrostatic inter- 
actions a for the coarse-grained (black solid) , united-atom (red dashed) , and all-atom (green dotted) 
models at To (or the temperature that gives R g « 33 A in the coarse-grained simulations) and pH 
7.4. The horizontal line and gray shaded region indicate the average and standard deviation over 
recent NMR, SAXS, and smFRET experimental measurements, (R g ) — 33.0 ± 7.7 A, for monomeric 
a-synuclein near T and neutral pH |j| 17] \S\ H2 1 I2" H 1^1 ■ 




residue pairs residue pairs 



Fig. 5: A comparison of FRET efficiencies ET e Q for twelve residue pairs from simulations and experiments 
of a-synuclein. In (a) the data includes FRET efficiencies from united-atom simulations of a random 
walk (red dashed) , collapsed globule (green dot-dot-dashed) , only electrostatic interactions at temper- 
ature To (blue dotted), and ratio of attractive hydrophobic to electrostatic interactions a = 1.1 (purple 
dot-dashed) at To and recent smFRET experiments [12] (black solid). Error bars from experiments 
were calculated using a resampling method that accounts for uncertainty in the determination of ET c g 
(1-2%) and variations in i?o (7-8%) due to the effects of the protein environment on the smFRET 
fluorophores. In (b) we compare the FRET efficiencies from recent smFRET experiments [T2J to the 
coarse-grained simulations of a random walk (red dashed) , collapsed globule (green dot-dot-dashed) , 
only electrostatics interactions (blue dotted), and both attractive hydrophobic and electrostatic in- 
teractions with a = 1.1 at a temperature that yields (R g ) « 33 A (purple dot-dashed), (c) The rms 
deviation A between the FRET efficiencies from the united-atom simulations and smFRET experi- 
ments (black solid) and the coarse-grained simulations and smFRET experiments (red dashed) versus 
a. The minimum rms A m i n 0.09 occurs near a ~ 1.1 for both the united-atom and coarse-grained 
simulations. 
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united-atom, and coarse-grained models, 
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where fi is the position of atom or monomer i, as 
a function of the ratio of the attractive hydropho- 
bic to electrostatic interactions a at temperature Tq 
and pH 7.4. For a 3> 1, the protein forms a col- 
lapsed globule with (R g ) « 12-15 A. Whereas for 
a<l, the models only include electrostatics inter- 
actions, and (Rg) is similar to the random walk values 
for the three models (all-atom: 42.8 A, united-atom: 
48.6 A, coarsc-gr ained: 48.2 A). The crossover be- 
tween random walk and collapsed globule behavior 
for (R g ) occurs near a ~ 1. 

A number of recent SAXS, NMR, and smFRET ex- 
periments have measured the radius of gyration for 
monomeric a-synuclein near To and neutral pH pfl [71 
EE [H HD 1221 121 HI- As shown in Fig. g the aver- 
age over these experimental measurements is (R g ) = 
33.0 ± 7.7 A, and thus the (R g ) for a-synuclein falls 
in between the random walk and collapsed globule 
values. 

We can more quantitatively compare simulation and 
experimental studies of a-synuclein by calculating 
the distributions of inter-residue distances or, equiv- 
alently, the FRET efficiencies. FRET efficiencies be- 
tween residues i and j are obtained from 
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where Rq = 54 A is the Forster distance for the fluo- 
rophore pair in Refs. [TT1 [25] and the angle brackets 
indicate an average over time. To calculate (Rij) from 
the FRET efficiencies, one must invert Eq. fusing the 
distribution of inter-residue separations P(Rij). 

The FRET efficiencies for the twelve residue pairs 
from recent smFRET experiments on a-synuclein [11 
and the united-atom and coarse-grained simulations 
are shown in Fig. [5] (a) and (b) . Errors in the inter- 
residue separation distributions can occur in both the 
directly measured ET e g values and Rq. To estimate 
the errors, we generated 10 decoy sets of inter-residue 
separations using ET c h and Rq values drawn from 
distributions accounting for the individual uncertain- 
ties. We then calculated the rms deviation over each 
decoy set assuming that we know Rq precisely. 

We identify several important features in the compar- 
ison of the FRET efficiencies from experiments and 
simulations in Fig. [5] (a) and (b): 1) The united-atom 



and coarse-grained models yield qualitatively similar 
results for the FRET efficiencies; 2) The FRET effi- 
ciencies for the random walk and pure electrostatics 
models are similar to each other and much lower than 
most of the residue pair FRET efficiencies from ex- 
periments; 3) The FRET efficiencies for the collapsed 
globule w 1 and do not match those from experi- 
ments; and 4) By tuning a, we are able to match 
quantitatively the FRET efficiencies from the exper- 
iments and simulations. 

As shown in Fig. [5] (c), the rms deviations A be- 
tween the FRET efficiencies from the united-atom 
simulations and smFRET experiments and between 
the FRET efficiencies from the coarse-grained sim- 
ulations and smFRET experiments are minimized 
when a « 1.1. For the united-atom model, a ~ 1.1 
gives (R g ) w 33 A, which is similar to that found 
in Ref. [12 . The largest deviations in the FRET ef- 
ficiencies between the united-atom simulations and 
smFRET experiments occur for small inter-residue 
separations, which are likely caused by the finite size 
of the dye molecules. Note that the deviations at 
small inter-residue separations are much weaker for 
the coarse-grained simulations. Thus, we find that it 
is crucial to include both electrostatic and attractive 
hydrophobic interactions in modeling a-synuclein in 
solution. 

For the coarse-grained simulations, we also studied 
the variation of the FRET efficiencies function 
of temperature (not only at T = Tq). In Fig. [6] 
we show the rms deviation between the FRET effi- 
ciencies for the coarse-grained simulations and sm- 
FRET experiments for the twelve residue pairs con- 
sidered in Ref. [12] as a function of a and kt,T/e r . We 
find that the line of a and k^T / e r values that give 
(R g ) — 33 A lies in the region where the rms devia- 
tions in the FRET efficiencies are minimized, which 
indicates that there is a class of polymeric structures 
with similar conformational statistics to that of a- 
synuclein. 

In Fig. [7] we compare the inter-residue separation 
distributions P(Rij) obtained from experimentally 
constrained Monte Carlo (ECMC) and united-atom 
(with a = 1.1) simulations. For the ECMC simu- 
lations discussed in detail in Ref. [T2], we assumed 
that P(Rij) was similar to that for a random walk 
C a model with only bond- length, bond-angle, and 
dihedral-angle (uS) constraints and repulsive Lennard- 
Jones interactions to obtain (Rij) from the experi- 
mentally measured FRET efficiencies. We find that 
(Rij) for the ECMC and united-atom simulations 
agree to within roug hly 10% (Fig. |] (left)), however, 
the standard deviations differ significantly, as shown 
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in Fig. [8] (right). The standard deviation of P(Rij) 
for the united-atom simulations is larger than that 
for the ECMC simulations for all residue pairs and 
scales as ctr ~ \i — j\ s with S ~ 0.6 (compared to 
the excluded volume random walk scaling exponent 
5 = 0.69). Further, an for residue pairs that are not 
constrained in ECMC do not obey the scaling behav- 
ior with i — j as found for residue pairs that were 
constrained (a R ~ \i — j\ s with S — 0.4 [12]). 



4 Conclusions and Future Directions 

We have shown that we are able to accurately model 
the conformational dynamics (i.e. the inter-residue 
separations) of the IDP a-synuclein at temperature 
To = 293K and neutral pH using all-atom, united- 
atom, and coarse-grained Langevin dynamics simu- 
lations. Our results show that the structure of a- 
synuclein is intermediate between that for random 
walks and collapsed globules with the rms separation 
an between residues i and j scaling as \i — j\ d with 

5 ~ 0.6. The calibrated Langevin dynamics simula- 
tions presented here have the advantage over con- 
straint methods in that physical forces act on all 
residues, not only on residue pairs that are mon- 
itored experimentally, and can be tuned to match 
FRET efficiencies from experiments. In future work, 
we will employ calibrated Langevin dynamics simu- 
lations to study the conformational dynamics of a- 
synuclein at low pH and the interaction and associa- 
tion between two or more a-synuclein monomers as a 
function of pH to identify mechanisms for a-synuclein 
oligomerization. In preliminary calibrated coarse- 
grained Langevin dynamics simulations, we find that 
two monomeric a-synuclein proteins only associate 
for sufficiently strong attractive hydrophobic interac- 
tions (a > 1.1), as shown in Fig. [9} 
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Fig. 6: RMS deviations A between the coarse-grained and experimental FRET efficiencies for the twelve 
residue pairs considered in Rcf.[12] as a function of a and k},T/e r . The dashed line indicates systems 
that give (R g ) — 33 A. Note that this line coincides with the minimum values for the rms deviations. 



54-72 
72-92 
9-33 
54-92 
92-130 
33-72 
9-54 
72-130 
9-72 
54-130 
33-130 
9-130 
1-140 
12-127 
14-125 
21-111 
21-118 
36-75 
38-77 
43-82 
57-133 
59-135 
63-139 




50 



100 



150 



50 



100 



150 



Fig. 7: Probability distributions for the inter-residue separations P(Rij) for the twelve residue pairs consid- 
ered in Ref. [T2| and eleven additional pairs for experimentally constrained Monte Carlo (ECMC) [T^] 
(left) and united-atom (with a = 1.1; right) simulations. The average inter-residue separations (Rij) 
for the united atom and ECMC simulations are shown with solid and dashed lines, respectively. 
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Fig. 8: 



Average (Rij 



and standard deviation aji of the inter-residue separation distributions in Fig.[7]for the 
united-atom (squares) and ECMC (triangles) simulations versus chemical distance between residues 
\i — j\. The filled symbols indicate residue pairs that were considered in smFRET experiments fT2"] 
and open symbols indicate other pairs. The solid and dashed lines have slopes 0.54 and 0.31 (left 
panel) and 0.62 and 0.38 (right panel), respectively. 




Fig. 9: Snapshots from preliminary aggregation studies of two monomeric a-synuclein proteins (dark green 
and light blue) using coarse-grained simulations with the temperature set so that (R g ) w 33 A at 
a = 1.1 (for individual protein monomers) for (a) a — 0.7 (b) 1.1, (c) 1.3, (d) 1.5, and (e) 1.8. 
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A Calibration of Atom Sizes 

In this Appendix, we test the choice of the atom sizes used in the all-atom and united-atom models by 



measuring the Ramachandran plot 1 15] for the backbone dihedral angles (f> and ijj. In Fig. 10 we show that 
the Ramachandran plot for the random walk all-atom model of a-synuclein with no attractive hydrophobic 
and electrostatic interactions and atom sizes from Ref. [2] closely resembles that for dipeptides with highly 
populated a-helix and /3-sheet regions. In Fig. [IT] we show the Ramachandran plots for the backbone 
dihedral angles <f> and -0 obtained from the random walk united-atom model of a-synuclein with no attractive 
hydrophobic and electrostatic interactions and atom sizes 0.8, 0.85, 0.9, 0.95, and 1.0 times those from 
Ref. [IS]. We find that the Ramachandran plot for united-atom model with a factor of 0.9 for the atom sizes 
is similar to that for the all-atom model. 
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Fig. 10: Ramachandran plot for the backbone dihedral angles <f> and ip obtained from the all-atom random 
walk simulations with no attractive hydrophobic and electrostatic interactions, and atom sizes given 
in Ref. [13]. The highly populated <j> and ip angles indicate /3-sheet (upper left) and a-helix (lower 
left) conformations. 
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Fig. 11: Ramachandran plot for the backbone dihedral angles <fi and ip obtained from the united-atom 
random walk simulations with no attractive hydrophobic and electrostatic interactions and atom 
sizes 0.8 (upper left), 0.85 (upper right), 0.9 (middle left), 0.95 (middle right), and 1.0 (bottom) 
times those given in Ref. |18| . 
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B Robustness of the Hydrophobic Interactions 



In this Appendix, we study the sensitivity of the FRET efficiencies for the united-atom simulations to small 
variations in the lengthscale <j a above which the attractive hydrophobic interactions are nonzero and relative 



strengths hi of the attractive hydrophobic interactions for different residues. In Fig. 12 (left), we show that 
the FRET efficiencies for the twelve residue pairs show only small variations with a a over the range from 
4.3 A to 5.2 A (except for 9-72 with a a = 4.3 A). In Fig. [l2] (right), we show that the FRET efficiencies for 
the twelve residue pairs are robust for Ah < 0.5. 
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Fig. 12: (left) FRET efficiencies ET e s for the twelve residue pairs considered in Ref. (X2] from smFRET 
experiments (upward triangles) and united-atom simulations with a set so that R„ w 33 A and 
cr a = 4.4 A (circles), 4.6 A (squares), 4.8 A (diamonds), 5.0 A (stars), and 5.2 A (pentagons), 
(right) FRET efficiencies ET e g for the twelve residue pairs considered in Ref. [H] for the united- 
atom simulations for a — 1.1 and varying hydrophobicity indices h! i = hi + Ah, where Ah is chosen 
from a zero-mean Gaussian distribution with standard deviation 0.0 (circles), 0.02 (squares), 0.05 
(diamonds), 0.05 (diamonds), 0.1 (stars), 0.3 (pentagons), 0.5 (hexagons). The average ET e s and 
its standard deviation for 32 samples are shown for each Ah. 
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